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Abstract 


This  document  is  the  fmal  report  for  ARPA  contract  MOA972-93-C-0044.  wtuch 
is  to  develop  an  ANS  (Adificial  Neural  System)  capable  of  modeling  sidxnarirte 
performstfice  based  on  full  scale  data  generated  u$ir>g  a  computer  based 
simulation  program. 

This  report  (ttscusses  the  project  backgrourxl.  induct  the  requirements  of  such 
a  simulation  model,  and  the  advantages  of  an  ANS  ^}proach.  The  (^vqiMness  of 
AWt‘8  Optimal-Entropy  Net^  Network  algorithm  »  discussed  as  weH  as  its 
significance  to  the  success  of  this  project.  AWi  has  developed  an  ANS  to  model 
submarine  performarKe  based  on  ttte  settmg  of  the  input  parameters  to  result  in 
a  pactloular  performarKe  for  the  submarkie  where  the  ANS  specifies  tfw  position 
arxf  ortentadon  of  the  submarine  sometime  in  the  future.  AWI  has  also 
developed  an  algorithm  to  run  the  ANS  in  the  inverse  mode,  namely  the 
algorithm  atiows  the  user  to  specify  the  desired  position  artd  orientatkx^  d  the 
submarine  at  some  time  in  the  kiture,  where  the  ANS  wM  then  specify  the 
controMng  ir^Hit  parameters  to  reach  that  specified  objective.  The  tatter 
specificatiorts  have  to  be  within  tite  realm  of  possible  requirements  or  else  the 
ANS  will  specify  a  poesibte  solution  dose  to  what  was  dMired.  The  developed 
ANS  is  capable  of  operating  in  a  PC  environment  The  results  are  obtained 
almoet  instantaneously  on  the  PC. 
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Introduction 

Th«  Devetopment  of  accurate  simutatk)n  modets  of  submam  behavior  has 
been  a  m^ler  of  vital  intere^  to  the  Navy  for  many  years.  Slmutation  models 
are  used  before  a  ship  is  built  to  evaluate  altemative  designs,  estimals  the 
submerged  operating  envelope  (SO£).  and  train  prospective  crews.  Subsequent 
to  ship  deployment,  simulation  models  are  used  to  continue  crew  training  and  to 
invesbgate  possible  arx)malous  behavior  observed  at  sea. 

Despite  this  long  inisrest  in  the  building  of  accurate  predictive  models,  the 
histi^  of  submarine  simulation  has  not  always  been  satisfying  in  terms  of  a 
basic  understandfog  of  foe  physical  phenomenon  underlying  observed  behavior. 
The  Arst  really  organized  approach  to  developing  equations  of  motion  for  U.S. 
subrraafoes  was  foe  now  famous  *2510*  r^xsrt  developed  by  Gerder  and  Hagen 
at  OTRC  In  1967.  These  equations  were  based  on  the  assumption  that  aft 
relevant  parameters  would  be  obtained  by  curve  fitting  foe  restits  of  static 
simple  osciliafory  measurements  tor  various  forces  and  moments  as  a  model  is 
moved  at  constant  sprMd.  As  a  result  foese  equations  are  tittle  more  than  a 
Taylor  series  expansion  of  foe  resulting  forces  arid  momants  with  certain  stohtfy 
(Afferent  forms  for  tiie  power  series  expansion.  For  example,  terms  in  X^are 
replaced  with  terms  in  xixl. 


In  foe  earty-70'e,  foe  Radfo  Controled  Model  (RCM)  was  developed  at  OTRC. 
This  provkied  a  slgnMcani  breaWhrough  In  tarma  of  modal  testing  because  the 
modal  was  no  longsr  artitidaly  constrained.  Even  more  Importantly,  actual  full- 
scale  maneuvers,  ouch  as  slam  plane  Jama,  could  be  atinulaiad.  Use  of  the 
RCM  permitted  dkect  predtetion  of  submarine  motions.  particUarly  during  high¬ 
speed  operations  foal  would  have  been  dMfIcull  or  dangerous  to  conduct  under 
foti-acale  oorKtitfons.  The  RCM  was,  and  is.  r^ipenalve  to  run  and  foere  are 
certain  timhatione  In  maneuvering  potential  because  of  the  size  of  the  MASK 
(Maneuver  and  Sea  Keeping  basin)  at  OTRC. 

In  the  late  itiTtrs.  foe  potential  for  foe  use  of  parameter  estimation  and  system 
identification  (SI),  which  grew  out  ^  optimal  oonirot  foeory,  was  explored.  Si 
tecfoniquea  were  developed  at  OTRC  and  to  fols  day  a  combination  of  fixed 
mooar  laaai.  nciw.  ano  sr  are  vie  masisaqfs  or  avnuHnon  oevenpmem  or 
submarine  motions  in  tie  U.S.  Navy. 


However,  aa  important  and  useM  as  foe  lechnlquoe  dfocuaeed  above  have 
been,  foe  reeufla  do  not  contribute  to  an  understandfog  of  9ia  hydrodynamics  of 
submarine  behavior,  in  addttlon.  SI  apptication  remalna  more  of  an  *ait  form*  as 
opposed  to  a  sdentitie  mefoodology.  Furiher,  foe  early  promise  tiiat  SI  would 
lead  not  only  to  a  more  predae  method  of  'curve  fllti^,  but  to  inaights  into 
hydraulic  responses  remalna  unfUtiWed.  As  an  example,  ttie  observed 
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pitch/depth  osdNalions  occur  in  the  OHIO  Class  st^>marin^  during  high 
speed  turns  a  simuiidsd  by  turning  on  a  sine  wave  pitch  and  deptt)  (fis&ift>ance. 

Current  research  efforts,  primarily  sponsored  by  ARPA's  Sub  Tech  program 
have  lead  to  the  establishment  of  a  Computational  Ruid  Dynamics  Laboratory  at 
OTRC.  This  biboralory  has.  as  one  of  its  primary  goals,  the  development  of 
methodologies  which  wil  perinit  the  prediction  of  a  subrruririe  perfonna^  from 
basic  design  information.  Attainment  of  tutt  goal  is  stM  in  the  future  and 
currently  it  continues  to  be  necessary  to  utilize  RCM.  fixed-model  testing,  and  SI 
methodologies  despite  their  Hrnitatiorrs. 

Artificial  Neural  System  (ANS)  has  proven  to  be  a  suoceiwful  approach  to  real¬ 
time  process  modsing.  However,  traditional  ANS  training  algorithms,  such  as 
bacfc-propagatlon  are  time  consumirrg.  Learning  instabilities  and  the 
determination  of  the  optimal  net  conliguraiion  are  other  time  consuming 
problems  associalsd  with  tradMonal  trainirH)  aigortthms. 


Adifmtiagm  of  th9  AWTb  AMS 

Compared  with  conventional  modeing  techniques,  ire  AWl  ANS  modeling 
approach  has  the  folowtng  advantages: 

•  Empirically  baaed.  The  ANS  modal  reties  solely  on  ei^erimenW  or 
numerical  data  which  Is  used  as  the  training  data  for  the  ANS  model. 

•  Real  time.  The  ANS  model.  In  tie  forward  mods,  is  capable  of  nmning  in 
real  time,  even  on  an  ordtiiary  MS<WS  based  machine. 

•  Near  real  time  inverse.  Not  only  can  tie  ANS  model  prstict  tie  Mure 
subrnarine  tra|ectory  whan  gl^  tia  cunem  oondWon  and  oontrol 
variablae.  such  as  tia  position  of  niddsr.  stem  planss  and  fomvard 
planae.  but  AWTs  ANS  also  can  dsfwmina  tia  nscastsry  control 
vartabias  to  lead  tie  submarine  to  sn  opsruor  spsdtiad  position  st  some 
Mure  time,  in  tie  inverse  mode  tie  ANS  is  cspeWs  of  operating  in  near- 
real  lima  on  an  MS-DOS  based  machins. 

To  ovarcoma  tia  problima  sasodaNd  wtti  tit  taittionai  ANS  irMng 
atgormna,  awi  oavaiopao  a  propnatary  wvng  aiguiWMti  canao  tia  oporidzad 
Entropy  algorWhm.  TTtia  algoritim  oonaldari  tia  daiti  arror  (tia  maan  arror  of 
tia  nalK  at  wal  aa  tia  "Entropy.*  as  tia  driving  foroaa  of  tia  nat  In  traditional 
training  algortihma  tia  maan  arror  Is  tie  only  driving  force;  tils  ntis  is  cstisd  tie 
"Deni  fMs.*  AWl  resaarchsri  have  found  tiai  tiara  is  another  driving  force 
tiat  we  can  "Entiopy.*  The  entropy,  or  lech  tiereof.  Is  tie  ms|or  reason  for 
learning  InitabtiWas.  such  as  local  minimum.  This  is  aoosnhMisd  vriisn  learning 
rafss  are  maxfmiied.  By  using  tie  Optimized  Ertiropy  rule.  AWl  has  reduced  tie 
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tim9  required  for  the  rwt  to  converge,  and  ir^reased  the  effiderv^y  of  the  net 
through  the  eiifnination  d  uneesenlial  nodes  and  links. 

The  unique  si^ficance  of  this  system  is  the  use  of  a  new  neural  network 
trainmg  algorithm,  caited  the  *Oplimi2ed  Entropy^  algorithm,  to  train  the  neural 
network.  This  algorithm  can  acoBlerate  the  network  learning  process,  overcome 
learning  mstabitttwe.  and  optimize  ^  net  configurabon.  The  Optimized  Entropy 
algorithm  enables  the  solutions  to  difficuit  problems  on  a  desktop  computer 
within  an  acceptable  time  frame. 


Ob^iectfvwe  for  ffi#  eurrenf  wort 

The  objective  for  this  project  is  to  show  the  feas^ity  of  usmg  a  PC  based  ANS 
to  model  the  submarine  trajectory  as  the  function  of  submarine  speed,  the 
poei^  of  the  rudder,  tte  stem  pianM,  and  fonvard  planes  ar)d  the  control 
planes’  holding  time.  The  training  data  used  in  this  project  is  obtamed  from  a 
computer  based  simulation  program  supplied  by  Dr.  John  Ware  at  Computer 
Sciences  Corporation  (CSC).  There  are  two  reasons  to  use  simulated  data 
instead  of  actual  oper^riioruri  data.  Rrst.  the  simulated  data  is  unclassified. 
Second,  if  the  ANS  can  simulals  the  simulation  program,  it  is  reasonable  to 
believe  that  the  ANS  aieo  can  simulate  the  operational  data.  The  ability  to  model 
the  simulated  data  wil  prove  the  feasibikty  of  using  the  PC  based  ANS.  and 
therefore  satisfy  the  objective  of  9m  program. 

Artificial  Ntunl  Systamg 

Arafldol  Neural  Systame  (ANS)  are  looeafy  based  on  biological  neural  networks 
end  offer  e  computer  technology  that  le  a  ueaM  tool  in  prooaes  modeling  and 
signal  proceeeing.  The  ANS  coneleri  of  a  sartae  of  nodes  (neurons)  and 
weighted  cormectione  (axone).  Aa  with  a  biologicei  neural  network,  the 
eseignment  of  the  valuee  of  tie  welghte  end  tie  size  and  configuration  of  the 
network  it  tie  key  to  a  luccaieki  net  UntortunaUify.  we  have  only  begun  to 
underaamd  tie  inner  workingi  of  tieea  conetructa.  Consequently,  reiailvely 
unaopNstlcaied  loole  are  currenty  employad  to  develop  working  networks. 

Traditionally  tie  approach  to  modal  devaiopmant  ie  to  gain  a  thorough 
undarstancfing  of  tie  undertying  edentlk.  or  engineering  prkidpias  of  tie 
proceea.  A  modelie  tien  developed  tiet  ie  baaed  on  methemaficil  retaBooehips 
jnharent  In  tie  proceea  perimeteri.  The  prtndpei  draetoack  witi  tts  approach 
ie  tie  fima  and  eltort  requked  to  deval^  an  underatanrang  of  tieea  basic 
•twiuilc  Or  ifiginMnnQ  rwnomfipo.  uopontanQ  on  mo  oompitxny  of  mo 
problam.  it  can  toka  many  yearn  and  an  extensive  research  program  to  devalop 
tia  raiatonahipa.  Ofian,  inataad.  a  rwmbar  of  simpitying  aasumpfions  are  mode 
end  an  approximate  model  is  developed.  That  approach,  whfie  being  an 
expedMoue  matiod  tor  developing  an  approxtoiata  model  tiat  could  be  useful, 
provides  only  a  tiaoreticai  approidmtfon  tatt  may  not  ba  valid  tor  tie  actual 
problem  appficatton.  With  tie  exMlng  modem  computeri  it  to  eleo  frequently 
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poostbto  to  develop  computer  based  nimnerical  solutions.  However  the 
generation  of  solutions  usir^  nunericai  analysis  on-line  are  often  Ux>  time- 
consi^ng  to  be  useful  for  real  time  aruUysts. 

The  ANS,  v^ien  a  network  can  be  found  to  solve  the  problem,  provides  an 
accurate  model  of  die  process  or  sigr^.  Neural  networks  are  empiricaily  based 
systems  that  when  presented  with  a  specific  input  pattern,  can  associate 
specific  output  patterns.  It  is,  essentolty,  a  highly  complex,  non-linear, 
malhemidicai  relationship  or  transform.  However,  it  is  not  necessary  for  the 
developer  of  such  a  system  to  urKlerstarid  the  basic  undertying  principles  of  a 
process  in  order  to  develop  a  highly  acojrate  ANS  based  nxxJel  of  the  process. 
Thus,  k!  thM  way  it  is  quite  different  from  c^her  mathematical  modeling  or  signal 
processing  approaches. 

Saeic  Mic^plee 

The  problem  of  an  ANS  is  to  decide  how  many  rxxfes  arxj  connections  are 
needed  to  model  a  ^>ecHfc  problem:  to  decide  how  to  configure  them;  arvl  to 
decide  the  specific  values  the  connection  weights  arxt  the  transfer  fur^tions 
that  exist  within  the  network.  Figure  1  represents  a  schematic  diagram  of  a 
neural  network  and  Figure  2  »  a  reprMerttaSon  of  the  weights,  transfer 
functions,  and  the  mechanisms  of  network  operation.  There  is  no  direct  known 
correspondence  between  the  network  pararrteters.  its  operation,  and  the  problem 
to  be  modeled  by  tie  network.  As  a  corwequsrx;e.  ttiere  are  currentty  a  lack  of 
good  mechanisms  tfiat  can  be  used  lo  dke^  assign  the  weights  and  transfer 
functions  m  the  network  to  that  it  can  solve  a  problem.  The  methodology  of 
fMng  a  proper  net  oonllgundlon  and  weights  to  model  a  given  problem  is  to 
step  through  a  series  of  searches  to  ftod  the  optimum  solulton.  This  process  is 
cafied  leaminQ.*  There  are  many  learning  algorithms  that  have  been 
developed*.  They  ai  have  some  achrantoges  and  restrictions,  in  the  modeling 
and  signal  proceMing  areas,  tfie  back-propagation  method  is  the  most  popular. 
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Back  Propagation 

The  back  propagation  method  assumes  that  the  search  in  weight  space  for  an 
optimum,  or  near  optimum,  network  configuration  can  be  accomplished  as  an 
iterative  search  using  the  error  gradient,  i.e..  slope  of  the  error  surface  in  La 
space^.  That  is,  a  series  of  moves  are  accomplished  on  the  multiple- 
dimensional  error  surface  using  approximately  the  maximum  mean  squared  error 
gradient  direction  as  the  move  direction  at  each  iteration.  The  error,  also  called 
the  delta,  in  the  network  is  defined  as  the  mean  squared  error  between  the 
desired  output  representation  and  the  actual  output  given  the  current  weight 
matrix  values  of  the  current  net.  By  calculating  the  maximum  gradient  of  the 
delta  for  any  given  training  example  (set  of  input  and  corresponding  output 
patterns),  the  weights  are  adjusted  so  that  the  net  moves  along  that  gradient 
direction  in  each  presentation  of  the  training  example  to  the  network^.  Using  this 
procedure,  the  network  slowly  “learns”  to  associate  all  of  the  training  example 
input  patterns  with  the  correct  corresponding  output  patterns  by  finding  a  global 
minimum  on  the  error  surface.  Since  the  primary  driving  force  in  the  back- 
propagation  method  is  the  mean  squared  error  -  delta,  it  also  called  the  “delta 
rule.’ 

This  “basic”  back  propagation  learning  process  has  several  significant 
drawb«:ks.  First,  the  optimum  configuration,  i.e.,  number  and  relative  location  of 
hidden  representation  units  or  nodes  cannot  be  pre-determined  and,  yet,  needs 
to  be  pre-assigned  by  using  an  “educated  guess”  in  order  to  use  this  procedure. 
Since  the  node  configuration  can  significantly  affect  the  operation  of  the 
network,  this  vi  i  at  best  le^  to  a  long  series  of  re-tries  and,  at  worse,  to  no 
usefd  network  at  all.  Second,  this  process  is  very  slow  and  the  learning  rate  (a 
constant  to  define  the  search  step  size)  is  set  arbitrarily,  traditionally  at  a  value 
between  zero  and  one.  Even  thixigh  research  is  being  conducted,  no  current 
krK)wn  method  for  predetermining  the  learning  rate  (gain  term)  will  consistently 
choose  an  optimum  value,  and  the  optfrrxxn  value  is  significantly  influenced  by 
the  specific- problem  being  presented  to  the  network.  Third,  it  has  been  shown 


*U.  ipece  Is  (Mined  M 

U«(R".dm) 

hsrt  R  Is  a  real  number  set.  twrafocs  R"  It  Ihs  vector  set. 

R"  ■  {( *1. •••*«)  I XI  e  R) 

The  dMancs  function  dm.  ledsllnedae 

<M(xi.jia.  ••XB).(y1.y^•••.yn))-((^-yt/)”♦(nc-y2/)%...•K/x^^^ 
where  l<»m<».  K  cen  be  SMNy  proven  that  tehan  m  approach  Infinity. 

L»»mex/xryy 

MethemeflcaMy,  the  Un  apace  ia  a  matrix  apace.  Moat  common  used  Lm  spacM  are:  Lt,  L2,  and 

U. 

m  tea  appicetion  La  apeoe  la  uaed  and  n  ia  equal  to  tfie  nur.  er  of  desired  outputs.  The 
dMance  fUrtoSon  (%  ia  used  to  deiarmina  the  error  or  error  gradtont-the  dMerance  to  desired  and 
ectuei  outpida.  The  error  datarmtoed  by  da  ia  aiao  celad  the  mam  square  error. 
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that  it  generatty  requires  a  larger  network  to  learn*  a  prc^em  than  is  required  to 
solve  the  problem.  Considering  the  many  studies.  It  is  g^erally  known  that  the 
net  could  be  trimmed  to  a  smaller  size.  However,  there  is  no  known  method  of 
reducing  the  size  of  the  network  optimally  after  Gaining  to  optimize  the  net 
performance.  Finally,  learning  instabilities  exist  in  many  problems  that  will 
cause  the  network  to  stop  learning  (converging).  One  of  these  instability  types 
is  known  as  a  local  minimum.  A  local  minimum  is  a  depression  in  the  error 
surface,  but  one  that  is  not  the  optimum  mininuim  or  lowest  error  pr^tion.  In  its 
search  routine,  the  network  algorithm  may  fall  into  a  local  minimum  and  since  the 
error  gradient  is  toward  the  local  minimum  from  all  directions,  may  not  be  able  to 
exit  from  it. 

Th€OpHml29d  Entropy  Notwork 

A  proprietary  method  developed  at  AWi  for  training  neural  networks  has  been 
shown  to  overcome  all  of  the  Imown  problems  with  the  back>pmpagation 
method,  while  maintaining  the  inherent  stability  arxj  Imown  network  development 
capabilities  of  that  method.  This  r>etwork  was  developed  through  the  use  of  a 
therrrxxtynamic  model  of  the  network  operation  that  irwluded  both  the  delta 
energy,  the  kinetics  of  the  network  aid  optimization  functic.is.  The  ^hnique, 
known  as  the  Optimized  Entropy  Network  (0*E  Net),  has  been  used  on  several 
applications  rarvging  from  high  speed  signal  processing  to  vision  systems. 

At  the  Arrwrican  Welding  Institute  researchers  have  studied  the  learning 
behaviors  of  artificial  neural  networks  for  several  years.  Based  on  the 
thermodynamic  and  kinetic  model  of  the  learrting.  it  was  discovered  that  the 
learning  is  r)ot  only  driven  by  the  delta,  but  also  by  another  irT^x>rtant  f8K:tor— the 
entropy,  which  is  defined  as  the  driving  forces  of  the  net.  The  entropy  is  a 
kinetic  measuremerrt  about  how  weQ  the  neural  network  is  learning  at  certain 
stages  of  the  training.  Sometimes,  even  if  the  delta  is  large  but  entropy  is  low. 
the  system  will  not  team.  This  is  tfie  case  when  the  system  reaches  a  local 
minimum  or  other  kind  of  learning  instability.  Additionalty.  the  mathematical 
functions  used  for  the  training  of  the  ANS  have  been  optimized  as  well.  The 
optimized  entropy  algorithm  will  monitor  the  entropy  dos^  durirrg  the  learning, 
maintaining  a  reasonable  entropy  whie  pushing  the  learning  rate  as  high  as 
possible  using  the  optimized  mathematfcal  functions. 

The  O-E  Net  has  achieved  learning  rates  greater  than  1000  times  that  of  the 
standard  back  propagation  method  for  complex  problems  while  also  preventing 
the  network  from  falling  Into  learning  instabilities.  Research  conducted  by  the 
AJI  researchers  has  confirmed  the  existsnce  of  at  le^  three  types  of  learning 
instabilities  (local  minimum,  trough  and  flat  surface)  and  the  0>E  Net  algorithm 
can  avoid  all  three  instabilities.  In  addHfon,  using  entropy  as  the  criticai  driving 
force,  the  system  configures  itself  dynamically  during  the  learning  process  and 
so  it  can  produce  a  near  optimum  network  size  for  operation,  often  much  smaller 
than  the  network  needed  to  *leam*  the  problem. 
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Dev«iopiiMfital  Procaduras 

The  ste(»  and  processes  used  in  the  development  of  the  ANS  for  ttiis  project  are 

I  descriM  in  detail  in  the  following  sections. 

[ 

I 

Impimnentatlon  Scheme 

For  this  feasibility  study,  it  was  decided  to  model  the  submarine  trajectory  over  a 
1 000  second  time  frame.  There  are  four  measurements  related  to  the  submarine 
trajectory:  Depth  (in  feet).  Pitch  (in  degrees).  Heading  (in  degrees).  aiKi  Roll  (in 
degrees).  The  control  variables  are: 

•  Thespeedof  the  submarine,  from  2  knots  to  15  toiots. 

•  The  ang^  of  the  stem  f^ane.  fmm  -15  degrees  to  -t-l  5  degrees. 

•  The  an^e  of  the  njdder  plane,  from  -15  degrees  to  •(•15  degrees. 

•  The  an^e  of  the  forward  plane,  from  -15  degrees  to  -t-l  5  degrees. 

•  The  planes'  holding  time  befc^e  returning  back  to  the  neutral  position, 
from  10  seconds  to  100  secor>ds. 


Two  sets  of  data  are  required  to  train  a  neural  network:  a  training  data  set  and  a 
testir>g  data  set.  The  trakrting  data  set  contains  the  data  which  is  presented  to  the 
neural  network  for  trakkng.  i.e.  development  of  the  ANS  structure  needed  for 
identification  of  the  desired  features  in  the  data  set.  The  testing  data  is  used  to 
test  the  performance  and  accuracy  of  the  trained  neural  networks.  To  verify  the 
accuracy  ol  the  fkial  ANS.  tie  test  data  set  contains  ail  combinations  of  the 
foMowiog  data  points,  also  known  as  via  points: 

•  Speed:  2. 2.5.  3.  3.5, 4. 5. 6. 7, 8. 9. 10. 11. 12. 13,  and  15  knots. 

•  Stem  plane:  -15,  -12.  -9,  -6,  -3. 0,  <f3,  >8.  •»•9.  •f12,  and  •f15  degrees. 

•  Rudder  plane:  *15,  -12.  -9.  -6.  -3. 0,  -fS.  *9,  and  -»•15  degrees. 

•  Forward  plane:  -15,  -12.  -9,  -6,  -3, 0,  •►3,  +9,  +12,  and  +15  degrees. 

•  Holding  Time:  10. 20. 35. 50. 65, 80,  and  100  sec. 

Therefore  there  sue  a  total  of  15x1 1x1 1x1 1x7s 139,755  test  data  samples. 

The  O-E  algorithm  wW  automaticafly  determine  the  training  samples  as 
necesaary  during  tie  training  process.  The  final  training  samples  are  listed  in 
Table  2  in  the  Training  section  arxi  Appendix  B.  Both  test  and  training  data 
samples  are  produced  by  the  simulation  program  supplied  by  Dr.  John. 

Experience  has  lead  AWI  to  develop  rrx^ple-oulput  networks  by  training  a  net 
for  each  trajectory  measurement  individually  and  then  combining  each  net  Into 
one  final  ANS.  Arftimeticaity  there  are  no  cRfforences,  only  savings  in  the  total 
Mnkig  time  that  would  (Mherwise  be  needed.  Another  advantage  is  the  ability 
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to  easily  add  new  types  of  output  at  a  later  time.  To  further  reduce  the  required 
training  time  arKi  size  of  the  net.  each  individual  neural  network  works  only 
within  a  certain  speed  range:  low  speed  ( 2  to  5  knots),  medium  speed  (5  to  10 
knots),  or  high  speed  (10  to  15  knots).  Therefore,  there  are  a  total  of  12 
individual  neural  networks  which  have  been  developed.  Table  1  lists  the  names 
of  each  individual  net  and  its  working  domain. 


Table  1.  Neural  Networks  Developed  For  this  Project 


Depth 

Pitch 

Heading 

Roil 

Low  Speed  (2-5  Knots) 

SUB11.NET 

SUB12.NET 

SUB13.NET 

SUB14.NET 

Medium  Speed  (5*10  Krxxs) 

SUB21.NET 

SUB22.NET 

SUB23.NET 

SUB24.NET 

High  Speed  (10*15  KtkXs) 

SUB31.NET 

SUB32.NET 

SUB33.NET 

SUB34.NET 

Although  12  individual  neural  networks  are  contained  in  the  system,  they  are 
fully  integrated  together  as  a  whole  through  the  mntime  user  interface.  Hence 
the  user  or)iy  “sees”  one  ANS. 

Qnphicat  Mtifac* 

Two  graphical  user  interfaces  were  developed  for  this  project.  One  of  the  user 
interfaces  was  or^  used  for  training  and  testing  the  individual  nets.  Another 
graphical  user  interfSK:e  (GUI)  was  developed  for  use  with  the  runtime  ANS, 
which  combines  all  twehra  nets  into  one.  Both  interfaces  were  developed  using 
tfie  programming  language.  They  ate  menu  driven  graphic  interfaces  which 
use  an  AW1  proprietary  GUI  mcunagement  tool  called  ’VflN.”  Both  interfaces  can 
be  run  on  either  an  IBM  RS6000  UNIX  workstation  or  a  DOS  based  IBM 
personal  computer.  To  reduce  the  training  time,  most  of  training  and  testing  was 
done  on  an  RS6000  workstation. 

and  Training  intarfaca 

The  input  to  one  individual  network  consists  of  the  speed,  stem  plane  angle, 
rudder  angle,  forward  plane  angle  and  the  hold  time  for  the  control  surfaces. 
There  are  a  total  of  78  output  nodes  for  each  itxfividual  neural  network  which  is 
logarithmically  distributed  in  the  1000  second  time  frame  (Figure  3).  Obviously, 
the  manner  of  choosing  the  proper  number  of  output  data  points  and  their 
distribution  has  lot  to  do  with  the  accuracy  and  training  time.  Several  schemes 
were  tested,  including  the  exponential  distribution,  linear  distribution,  and 
logarithmic  distribution  combined  with  different  numbers  of  output  nodes  from  50 
to  100.  The  78  output  nodes  with  a  logarithmic  distribution  is  the  best 
combination  based  on  experience  and  is  chosen  as  the  final  configuration. 
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The  training  GUI  developed  by  AWI  is  a  generic  user-friendly  interface  (Figure 
4).  This  interface  is  written  in  the  Oh-  programming  ianguage  and  can  run  on  an 
IBM-or  compatible  PC  and  an  RS  6000  workstation  without  code  modification. 
Using  this  interftee,  the  training  and  testing  can  be  done  in  an  "automatic”  or 
*manuar  manner.  The  training  files  and  training  data  are  easily  accessible 
through  the  GUI.  The  user  can  easily  adjust  the  training  parameters  manually, 
such  as  the  learning  rate,  net  configuration,  input/output  distnbution  functions, 
and  weights. 
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This  interface  also  allows  the  generation  of  the  training  and  testing  samples 
using  the  numerical  simulation  program,  which  is  a  non-GUI  program.  The  data 
can  be  gerterated  rarKkxnly  or  automatically.  For  random  generation,  the  user 
can  type  in  variables  such  as  the  time  and  angle  of  each  control  plane  through 
the  GUI  (Figure  5),  then  the  GUI  will  cause  the  simulation  program  to  calculate 
the  results.  The  results  can  then  be  displayed  graphically  on  the  screen  (Figure 
6).  For  automatic  data  generation,  the  user  can  specify  the  via  points  of  each 
variable,  then  the  GUI  will  automaticaliy  generate  ail  samples  on  the  grid  and 
store  them  on  the  hard  disk.  Another  button  on  the  GUI  allows  the  user  to 
convert  this  data  into  a  format  which  c£ui  be  directly  used  for  training  and  testing 
purposes  (see  Rgure  6). 
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Figures  Sample  File  Generation 
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Figure  6  The  Sample  Generated  by  the  Numerical  Computation 


As  previously  mentioned,  AWI  has  trained  twelve  individual  nets,  each  of  which 
is  responsible  for  performing  one  measurement  of  the  submarine  trajectory  in 
one  speed  range.  For  each  net,  there  is  a  job  file  and  a  training  file  associated 
with  the  training  process.  Therefore,  for  the  problem  discussed  herein,  there  are 
twelve  job  files  and  twelve  training  files.  The  job  file  contains  ttie  general 
information  of  the  trained  net,  such  as  the  number  of  input  and  output  nodes, 
and  the  distribution  scheme  of  the  output  nodes.  The  training  file  contains  all 
training  and  testing  file  names.  Each  time,  the  GUI  can  only  train  and  test  a 
single  job.  By  using  the  "Load  Job”  button,  various  jobs  can  be  loaded  into  the 
GUI  for  training  and  testing. 

Trial  Training 

As  previously  discussed,  two  very  important  factors  must  first  be  determined, 
which  are  the  number  of  output  nodes  and  the  distribution  scheme  of  these 
nodes  in  the  1000  second  time  frame.  Using  the  training  interface,  a  series  of 
rough  trial  training  runs  are  conducted  to  determine  the  optimum  number  of 
output  nodes  and  the  distribution  scheme.  The  objective  of  a  trial  training  is  to 
train  the  net  to  within  a  20%  error  level  with  various  combination  of  output  nodes 
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and  distribution  schemes.  The  testing  range  for  the  number  of  output  nodes  is 
from  50  to  100.  The  distribution  functions  used  in  the  test  are  the  linear, 
exponential,  and  logarithmic.  The  optimum  combination  is  determined  based  on 
the  number  of  training  cycles  and  the  overall  time  needed  for  each  trial  net  to 
converge.  Based  on  the  results  from  these  trial  trainings,  a  78  output  node 
network  using  logarithmic  distribution  was  selected. 

Training 

As  previously  identified,  the  training  was  done  individually  for  each 
measurement  of  the  trajectory  within  certain  speed  range.  Each  individual 
training  set  was  done  in  the  following  manner.  Initial  training  of  the  net  was 
performed  with  10  randomly  selected  training  samples.  After  the  net  converges, 
it  is  tested  on  all  139.755  testing  samples.  If  the  result  is  not  satisfactory,  the  E- 
O  algorithm  will  suggest  the  next  training  sample  which  needs  to  be  added. 
Using  the  numerical  simulation  program,  the  designated  sample  will  be 
gerterated  arto  added  into  the  training  set  automatically.  The  net  will  be  retrained 
using  this  new  training  sample.  This  procedure  will  be  repeated  many  times  until 
the  final  testing  result  is  satisfactory. 

Through  the  training  interface,  the  above  procedure  can  be  done  automatically. 
Table  2  summarizes  the  number  of  training  samples  used  by  each  individual  net 
and  Appendix  B  lists  detailed  information  of  all  training  samples  used  in  the 
trainir^. 


Table  2  Number  Of  Training  Samples  Used  By  Each  Net 


Net  Name 

Number  of  training 
SAMPLES 

SUB11.NET 

287 

SUB12.NET 

197 

SUB13.NET 

69 

SUB14.NET 

130 

SUB21.NET 

175 

SUB22.NET 

255 

SUB23.NET 

386 

SUB24.NET 

254 

SUB31.NET 

301 

SUB32.NET 

248 

SUB33.NET 

485 

SUB34.NET 

183 

Even  though  there  are  12  individual  ANSs,  there  are  some  similarities.  For 
example,  the  number  of  input  nodes  and  output  nodes  tor  each  ANS  is  the  same. 
The  differences  lie  in  the  number  of  hidden  layers  and  nodes  within  those 
hidden  layers.  The  optimization  procedures  of  the  Optimized  Entropy  rule  yield 
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individual  representations.  Rgures  7  shows  schematic  representations  of  two 
ANSs  thus  configured.  The  SUB12  has  S  input  nodes,  78  output  nodes,  and  two 
hidden  layers  of  twelve  and  fourteen  nodes  respectiveiy.  The  SUB14  has  the 
same  input  and  output  nodes,  yet  it  consists  of  two  hidden  layers  of  ten  and 
twelve  nodes  respectively.  This  latter  ANS  structure  is  designated  as 
5x12x1 1x78.  Table  3  summarizes  the  net  configuration  for  each  individual  neural 
network. 


(7a)  SUB12.NET 
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(7b)  SUB14.NET 

Figure  7  The  neural  network  configuration  for  SUB12.NET  and  SUB14.NET 


Table  3  Net  Configuration  for  Each  Net 


IWI  IWflv 


SUB11.NET 


SUB12.NET 


SUB13.NET 


SUB14.NET 


SUB21.NET 


SUB22.NET 


SUB23.NET 


SUB24.NET 


SUB31.NET 


SUB32.NET 


SUB33.NET 


SUB34.NET 


Summa 


5x12x15x78 


5x12x14x78 


5x12x14x78 


5x10x12x78 


5x12x14x78 


5x12x12x78 


5x12x14x78 


5x12x11x78 


5x14x13x78 


5x10x11x78 


5x15x15x78 


5x11x14x78 


5x144x159x78 


Number  Of  Links 


2916 


2820 


2820 


2436 


2820 


2628 


2820 


2532 


2918 


2342 


3213 


2722 


32987 
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T99tlng 

There  are  two  types  of  testing,  namely:  automatic  and  random.  In  the  automatic 
nxxie.  the  program  will  automaticaBy  test  the  current  net  through  the  entire 
139.755  test  data  and  record  the  maximum  error,  mean  error,  and  other  useful 
statistical  results  into  a  file  (see  Appendix  A)  and/or  display  the  results  on  the 
screen  (Figure  8).  In  Figure  8  the  results  should  be  read  as  follows.  Each 
horizontal  plot  (there  are  8  such  plots  in  the  Figure)  shows  the  sample  number  in 
the  upper  left  box,  below  that  the  maximum  error  for  that  sample  aixl  below  that 
the  mean  error  tor  that  sample  when  the  computer  generated  results  and  the 
ANS  generated  results  are  compared.  Looking  at  the  plotted  values,  the  first 
five  lines  represent  the  input  parameters,  the  next  78  data  points  are  the  values 
generated  ^  CSC's  simulation  program  and  the  last  78  data  points  are  those 
determtoed  by  the  ANS  tor  that  sample.  In  the  random  testing  mode,  the  user 
can  specify  the  input  of  any  value  within  the  operating  domain,  and  the  output  of 
the  neural  network  will  then  be  compared  with  the  result  of  the  numerical 
simulation  program  and  the  output  cunres  will  be  plotted  on  the  computer  screen 
for  comparison. 


Rgure  8.  The  tasting  result  screen. 


21 


Awm-ooe 


RmUnm  QIM  tnd  Combined  Neural  Net 

The  sufTvnation  of  each  incivtdual  net  in  the  final  run>time  configuration  yields 
the  same  results  as  if  the  net  were  configured  as  a  whole.  The  real  benefits  are 
in  the  saving  of  a  gre^  (teal  of  development  time  and  effort  Figure  9  shows  the 
runtime  GUI. 


Figures  The  runtime  GUI 


Results  and  Analysis 

AWI  has  developed  an  ANS  based  submarine  simulalion  with  a  maximum  error 
over  the  entire  domain  of  the  ANS  no  larger  than  7%.  In  most  cases  the  actual 
error  is  signHIcarTtty  less.  Table  4  summarizes  these  results  for  each  ANS. 

The  maximum  error,  E  is  caicuiated  by  the  following  formula: 


ExnaxiiOrDiiyN 


Here  Oi  is  the  actual  output  of  the  i**  node  of  the  ANS  arKl  Di  is  the  desired  value 
of  the  output  rKXle.  where  i  runs  from  1  to  78.  N  is  the  dynamic  range  of  the 
output,  which  is  Rsted  in  Table  5 
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Table  4  Final  Results 


iiaii  ■ 

Hm  rimwW 

Maximum  Error  E 

SUB11.NET 

0.02 

SUB12.NET 

0.045 

SUB13.NET 

0.06 

SUB14.NET 

0.045 

SUB21.NET 

0.04 

SUB22.NET 

0.06 

SUB23.NET 

0.02 

SUB24.NET 

0.07 

SUB31.NET 

0.06 

SUB32.NET 

0.06 

SUB33.NET 

0.03 

SU834.NET 

0.06 

*RMults  an  baaed  on  tatting  the  dtvalopad  ANS  against  139.7S5  teat  data 
poirtts. 


Table  5  The  Dynamic  Ftenga.  N 


NsINmis 

N 

8UB11.NET 

340 

SUB12iCT 

34 

8UB13iCT 

200 

SUB14JCT 

4 

SUB21MET 

3000 

SUB22JCT 

100 

SUB23itET 

300 

SUB24ilET 

8 

SUB311CT 

8000 

SUB32.NET 

180 

SUB33ilET 

440 

SUB34.NEt 

32 

Tha  ayslam  can  ba  run  in  afthar  tomsard  or  inearsa  moda.  Whan  the  system  is 
running  in  tia  forward  mode,  tfta  scrsan  wM  ba  simiar  to  Rgura  10.  Moving 
siida  bars  of  each  control  variabla  causes  the  submartna  trajactory  to 
cafouialad  immadfolsly  and  graphicaiy  dfoplayad  on  the  scrsan.  Altamathr 
the  daairad  settings  for  the  control  varlabia  can  ba  input  diractty  by  highlighting 
and  than  typing  the  vaiua  in  the  box  under  the  sMa  bar. 

On  die  forward  screen,  the  control  variabla.  adtich  is  the  input  to  the  ANS  is 
shown  on  the  siida  bar  tocMI  on  the  bottom  part  of  the  screen.  These  bars  can 
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be  slid  up  or  down  using  mouse.  A  additional  slide  called  “Scale”  is  shown  on 
the  same  screen  as  well.  Moving  this  slide  can  change  the  output  plot  scale.  The 
predicted  submarine  trajectory  will  graphically  displayed  on  the  top  part  of  the 
screen,  which  are  the  depth,  pitch,  heading,  and  roll.  The  positive  depth  means 
the  submarine  goes  ^wn.  The  pitch,  heading,  and  rolling  angle  In 
counterclockwise  direction  facing  the  bow  of  the  submarine  are  positive. 


Figure  10  Running  the  system  in  fonvard  mode  on  the  runtime  GUI 

When  the  system  runs  in  the  kwerse  mode,  the  screen  will  be  similar  to  Figure 
11.  The  user  can  specify  a  point  on  the  submarine  trajectory  at  some  time  in  the 
future.  For  exam^.  the  user  can  rec^ire  t^»at  the  si^i^rine  to  be  at  a  depth  of 
1500  feet,  a  hea^ng  of  -30  de^^ee.  a  pitch  of  0  degrees  and  at  a  roll  angle  of  0 
degrees  relative  to  current  position  at  a  point  in  time  800  seconds  later.  Then  the 
system  will  recommerxi  the  proper  values  for  each  control  vanable  in  order  to 
lead  the  subnwir^  to  pass  the  desired  point  800  seconds  later  as  closely  as 
possible  (see  Figure  12)  assuming  the  user  did  not  request  an  impossible 
condition.  If  the  user  dkf  ask  for  the  impossible,  then  the  ANS  still  tries  to  match 
the  request  as  closely  as  possible. 
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Figure  1 1  Rimnlno  the  system  in  the  inverse  mode  on  the  runtime  GUI 


Figure  i2.  The  inverse  result. 
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When  the  system  runs  in  the  inverse  mode,  the  control  variables  can  be 
weighted  in  a  range  from  0  to  5.  This  weight  represents  the  freedom  to  change  a 
particular  parameter.  Weight  0  means  no  freedom  at  all,  which  simply  means 
that  this  control  variable  cannot  be  changed.  For  example,  if  the  speed  is  12 
knots  and  the  weight  for  the  speed  is  0.  the  algorithm  must  keep  the  speed 
constant  at  12  krrots  while  attempting  to  vary  other  control  variables,  such  as  the 
angle  of  the  stem  plane,  mdder.  and  forward  plane,  and  hold  time,  so  that  the 
submarine  can  pass  the  required  point  at  the  required  time  as  closely  as 
possible.  Weight  1  means  this  variable  has  least  freedom,  while  weight  5  means 
this  variable  has  the  greatest  freedom  of  change.  The  algorithm  will  attempt  to 
adjust  the  variables  which  have  the  highest  degree  of  freedom  first,  and  the 
variable(s)  with  lowest  degree  of  freedom  last  if  necessary. 

Each  irKlividual  measurem^  of  the  trajectory  also  has  a  weight  associated  with 
it  when  the  system  is  runnirtg  in  the  inverse  mode.  This  weight  controls  how 
closely  this  measurement  has  to  be  matched.  Weight  0  means  it  is  not  important 
at  all  while  5  indicates  that  it  is  the  most  important 

Running  a  neural  network  in  the  inverse  mode  represents  a  search  problem 
which  can  be  approached  with  a  thorough  search  or  a  gradient  search.  The 
thorough  search  guarantees  that  the  best  possible  answer  will  be  found,  but  it  is 
also  very  tkne  consuming.  The  gradient  search  can  save  a  great  deal  of  time, 
but  the  local  minimum  (or  local  maximum)  problem  can  occur  and  prevent  the 
system  from  continuing  to  find  the  best  solution.  The  O-E  algorithm  used  in  our 
ANS  training  is  adapted  to  inverse  searching  and  speeds  up  the  search  while  it 
guarantees  that  it  wW  find  the  giobai  minimum  solution.  The  entire  search 
process  only  takes  about  4  seconds  on  a  90  MHz  Pentium  PC. 

Conclusions 

AWI  has  successfully  demonstrated  the  feasibility  of  modeling  the  submarine 
tn^ectory  using  ANS.  Several  advantages  have  been  revealed  through  this 
study: 


•  Empirical  based  model  can  be  developed  without  the  use  of  human 
exp^. 

•  The  ANS  model  can  be  developed  at  a  significantly  lower  cost  than 
tracfttional  modeling  approaches. 

•  Requires  less  computing  power  to  run  the  finished  model  which  enables 
the  forward  system  to  run  in  real  time  on  a  conventional  PC. 

•  Near  real  time  inverse  ntodel  has  been  developed,  which  can  be  used  in 
a  crew  tratoing  system  or  an  automatic  control  system.  Hie  inverse 
model  can  be  developed  to  run  in  real  time,  although  the  current  ANS 
takes  4  seconds  to  determine  input  parameters  on  a  Pentium  based  PC. 
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Future  Work 

Several  possible  future  steps  are  suggested: 

•  Apply  AWI's  proprietary  ANS  approach  with  real  data  instead  of  simulated 
data. 

•  Using  a  multiple  processor  workstation,  train  the  ANS  to  a  much  higher 
accuracy. 

•  Apply  the  ANS  as  a  training  tool  for  submarine. 
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Appendix  A  Test  Results 

Testing  Result  File  (Partial) 
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Structure:5x1 0x1 1  x78 
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Appendix  B  The  Training  Data  Set 

Note:  The  Speed  is  in  feet.  Stem,  rudder,  and  forward  piane  angie  is  in 
degree,  and  the  Hoid-time  is  in  1/10  of  seconds. 
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